
Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(11) 



CM 

CO 
CO 
CO 

o 

Q_ 
LU 



(12) 



EUROPEAN PATENT APPLICATION 



EP 0 833 244 A1 



(43) Date of publication: 

01.04.1998 Bulletin 1998/14 

(21) Application number: 97307077.4 

(22) Date of filing: 12.09.1997 



(51) int. CI. 6 : G06F 7/52, G06F 7/544, 
H03K 19/177, G06F 15/78 



(84) Designated Contracting States: 


(72) Inventor: Perner, Frederick A. 


AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 


Palo Alto, CA 94306 (US) 


NL PT SE 






(74) Representative: 


(30) Priority: 26.09.1996 US 718849 


Powell, Stephen David et al 




WILLIAMS, POWELL & ASSOCIATES 


(71) Applicant: 


34 Tavistock Street 


Hewlett-Packard Company 


London WC2E 7PB (GB) 


Palo Alto, California 94304 (US) 





(54) Arithmetic cell for field programmable devices 



(57) An arithmetic cell (100) to be used in field pro- 
grammable devices combines multipliers (200), multipli- 
ers/accumulators (300, 400, 500) and adders (addition, 
compare, and subtraction) in one compact cell. The cell 
(100) may be used in a flexible manner that allows full 
multipliers of any dimension (n*m products), adders of 
any length (n+m sums, compare, differences), accumu- 
lators, and registers (to hold complete results or partial 



products). The cell comprises an application controlled 
multiplexer (111, 112), signal routing (101-107) to pro- 
vide a shift function for multiplication, and a minimum 
collection of configuration bits (CF_1-CF_7, CF_FF, 
CF_ADDR, CF_S}) and circuit elements to perform the 
basic arithmetic functions. 
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Description 

The present invention relates to arithmetic cells, 
e.g. in the form of application specific integrated circuit 
devices and,.' more particularly, to programmable ele- 
ments within field programmable devices. 

Application specific integrated circuits ("ASIC") pro- 
/- vide their users the ability to manufacture products hav- 
ing a proprietarydesign without having to begin design 
at the device level. Many different ASIC technologies 
are available, including gate array, standard cell, full 
custom design and programmable logic devices. The 
gate array, standard cell and full custom technologies 
offer high performance and flexibility, although at the 
expense of a long development cycle and a high non- 
recurring engineering cost. Programmable logic 
devices, which include programmable logic array 
devices ("PLD"), field programmable logic array devices 
("FPLA i '), and programmable gate array devices 
("FPGA"), are useful in a variety of applications such as 
glue logic and simple integrated circuit designs. Their 
utility has been limited in general due to the relatively 
low number of available usable gates, poor flexibility in 
implementing the required logic functions, low speed of 
operation and high power consumption. 

mcicaouii iui nits m emeu uuiuy ui iiGiu pfuyicimma- 
ble devices is generally due to the exponential increase 
in size of the array as the number of logic elements 
increase. The programmable elements in a programma- 
ble array are constructed of active devices such as 
diodes or transistors interconnected by fusible links, 
ultraviolet erasable or electrically erasable floating gate 
devices or active devices controlled by local memory 
circuits. Generally such devices are resistive and high in 
parasitic capacitance, and consume a relatively large 
silicon area. Because each programmable element typ- 
ically consumes a certain amount of power, as the array 
size increases, the power dissipation increases. Moreo- 
ver, the speed decreases with increasing array size, due 
to the increasingly parasitic capacitance of the array. 
Hence, for a given technology, the size of the array is 
limited by practical limitations. 

The utility of field programmable devices has been 
further extended by the technique of a programmable 
interconnect network for devices consisting of small pro- 
grammable logic functional units. A typical field pro- 
grammable device has an array of programmable logic 
functional units. Each functional unit performs one or 
more logical functions. Vertical and horizontal lines are 
permanently connected to the inputs and similarly to the 
outputs of each functional unit. The horizontal and verti- 
cal lines cross at various areas of the device to form pro- 
grammable interconnection matrices. Programming 
may be by mask, conductive fuses, E P ROM/E E P ROM . 
switching transistors, or normal logic transistors con- 
nected to programming memory elements. Other types 
of functional units such as random access memories 
and arithmetic logic units may be present. 



Furthermore, existing field programmable devices 
have attempted to include features in their field pro- 
grammable cells to enhance the arithmetic operation of 
addition. These devices contain four-input logic tables, 

5 flip-flops, carry logic, and configuration switches, as well 
as circuitry for implementing adders and logic that may 
include carry signals to nearest neighbor ceils. Thus, 
basic arithmetic functions may be implemented as gen- 
eral logic in these field programmable devices. Again, 

w however, although arithmetic functions can be found in 
many applications, these functions tend to fit poorly into 
current field programmable devices further reducing 
density and performance. 

Moreover, there is currently no mechanism that can 

is be used in a FPLA to efficiently perform multiplication. 

Thus, it can be seen that poor circuit density and 
performance limits of current technology field program- 
mable devices limits the use of these devices in many 
applications. 

20 Therefore, there is an unresolved need for a field 
programmable device that will significantly improve the 
density and performance of applications that use com- 
mon arithmetic functions. 

The invention is a method and apparatus that pro- 
ps vides a field programmable device with an arithmetic 
cell that significantly improves the density and perform- 
ance of applications that use common arithmetic func- 
tions. Within the cell, application accessible 
configuration switches and bit shifting circuits are added 
30 to create an efficient basic arithmetic cell to be used as 
a programmable element in a field programmable 
device. Furthermore, multiplication is enhanced through 
the addition of two multiplexers and cell routing to allow 
a data shift. 

35 Thus, an arithmetic cell to be used in field program- 
mable devices is defined in this invention. This arithme- 
tic cell permits efficient implementation of multipliers, 
multipliers/accumulators, and adders (addition, com- 
pare, and subtraction) in one compact cell that is a col- 

40 lection of circuits common to field programmable 
devices. This arithmetic cell is used in a flexible manner 
that permits implementation of full multipliers of any 
dimension (n*m products), adders of any length (n+m 
sums, compare, differences), accumulators and regis- 

45 ters (to hold complete results or partial products). Key 
elements in this invention are an application controlled 
multiplexer, signal routing to provide a shift function for 
rriul'dplication, and a minimum collection of configuration 
bits and circuit elements to perform the basic arithmetic 

so functions. 

The invention will be readily understood by the fol- 
lowing detailed description in conjunction with the 
accompanying drawings, wherein like reference numer- 
als designate like structural elements, and in which: 

55 

Figure 1 is a circuit schematic of a configurable 
arithmetic cell for a field programmable device 
wherein the cell is illustrative of a cell constructed 
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according to the present invention; 
Figure 2 is a circuit schematic of a 3-by-3 matrix of 
arithmetic ceils of the type represented in Figure 1 
wherein the cells are configured to provide a multi- 
plier circuit; ■ 5 
Figure 3 is a circuit schematic of a 4-by-1 matrix of 

. arithmetic cells of the type represented in Figure 1 
wherein the cells are configured to provide an accu- 
mulator circuit; 

; Figure 4 is a circuit schematic of a 3-by-3 matrix of 10 
arithmetic cells of the type represented in Figure 1 
wherein ttie cells are configured to provide a 2 bit 
multiplier accumulator circuit; and 
Figure 5 is a circuit schematic of a 5-by-5 matrix of 
arithmetic cells of the type represented in Figure 1 15 
wherein the cells are configured to provide a 4 bit 
multiplier accumulator circuit. 

Embodiments of the invention are discussed below 
with reference to Figures 1 -5. Those skilled in the art will 20 
readily appreciate that the detailed description given 
herein with respect to these figures is for explanatory 
purposes, however, because the invention extends 
beyond these limited embodiments. 

Figure 1 is a circuit schematic of a configurable 25 
arithmetic cell 100 for a field programmable device 
wherein the cell 100 is an illustrative embodiment of a 
cell * constructed according to the present invention. 
Arithmetic cell 100 includes steering logic 101-107, fJip 
flop (FF) circuit 1 08, a 1 6 bit look-up table (LUT) conf ig- 30 
ured as an adder (ADDR) circuit 109, switch 110 and 
multiplexer circuits 111-112 (M1-M2). Each of the ele- 
ments 1 01 -1 1 2 of cell 1 00 are representative of classes 
of known components and therefore further discussion 
of these elements will be limited to describing how they 35 
operate within the context of the arithmetic cell of the 
present invention. 

For one embodiment of cell 100, ADDR 109 is a full 
adder circuit having input and output signals. Signals A, 
B, C_IN and CF_ADDR are input signals wherein sig- 40 
nals A, B and CJN are respective first, second, and 
third addend signals and signal CF_ADDR is a 36 bit 
configuration signal for an 8-by-2 look-up table (LUT) 
that implements the adder circuit. Signals R and 
C_OUT are output signals wherein signal R is the result 45- 
signal (i.e., the sum of addend signals A, B and carry 
input signal C_IN) and wherein signal C_OUT is the 
carry output signal representing the carry, if any, formed 
when calculating result signal R. 

Similarly, for one embodiment of cell 100, flip flop so 
(FF) 1 08 is a D-type flip flop circuit having input and out- 
put signals. Signals IN, CLK, RESET and CF_FF are 
input signals wherein signal IN is an input signal, repre- 
senting the datum to be latched in FF 108. signals CLK 
and RESET are respective clock and reset signals for ss 
FF 108 and signal CF_FF is a configuration signal for 
the flip flop circuit. Signal OUT is an output signal indi- 
cating the datum value latched in flip flop 1 08. 



In arithmetic cell 100, steering logic<101-107 have 
respective configuration signals CF_1-CF^7.^Similarly, 
as will be explained below in greater detail, M1 and M2 
(i.e., respective multiplexers 1 1 1 and 112) are also xonr 
trolled by control signals. Configuration signals CF_TK 
CF_7, CF_ADDR and CF_FF control the configuration 
of arithmetic cell 100 and permit cell arrays to /provide 
flexible field programmable devices.* Thus, cell 1 00 will 
permit efficient implementations of multipliers, multipli- 
ers/accumulators, and adders (addition, compare, and 
subtraction) in one compact cell that is a collection of 
circuits common to field programmable devices. This 
cell may be used in a flexible manner that allows full 
multipliers of any dimension (n*m products), adders of 
any length (n+m sums, compare : differences), accumu- 
lators, and registers (to hold complete results or partial 
products). Key elements in this invention are an applica- 
tion controlled multiplexer, signal routing to provide a 
shift function for multiplication, and a minimum collec- 
tion of configuration bits and circuit elements to perform 
the basic arithmetic functions. 

Thus, figure 1 is circuit schematic of an exemplary 
basic arithmetic cell , of this invention. The elements 
ADDR 109, FF 1.08, and steering logic 101-107 are 
common field programmable cell components. Multi- 
plexers 111 and 112 (M1 and M2) and switch 110 are 
introduced as a key parts of this invention to permit this 
adder cell to function as an efficient multiplier. Further- 
more, the configuration of the other components is set 
to give this cell the flexibility to function as any part of a 
regular array circuit that will perform multiplication as 
well as addition or accumulate functions. Herein, "accu- 
mulate" is defined as the circuit function of adding a 
number to the contents of a register. , 

Multiplication is performed by testing the active bit 
of the multiplier and performing the multiplication opera- 
tion differently based upon the value of the active bit. If 
the active bit of the multiplier is a "1", the multiplicand is 
added to a partial product and the resultant partial prod- 
uct is then shifted to the next row and one column over 
in the the multiplier array. However, if the active bit of the 
multiplier is a "0", the resultant partial product is simply 
shifted down one row and over one column. In cell 100, 
multiplexers 111 and 112 (M1 and M2) perform the test 
on the active bit of the multiplier. Routing within the 
arithmetic cell then performs the partial product shift. 

One example of the application of the arithmetic cell 
100 is a 3-by-3 multiplier. Figure 2 is a circuit schematic 
of a 3-by-3 multiplier comprised of a 3-by-3 matrix of 
arithmetic cells 100 shown in Figure 1 . A 3-bit multiplier 
signal (B2;B0) enters the "right" edge of the array (cells 
200:202), ,a 3-bit multiplicand signal (A2:A0) enters the 
"top" of the array (cells 220, 210 and 200), and a prod- 
uct signal (R5:R0) exits the "right" edge and the "bot- 
tom" of the array (cells 200202, 212, and 222). The" 
top" row .of cells (cells 220, 210 and 200) introduce a "0" 
initial, partial product to the array with configuration bits 
(CF_4, CF_5, and CF_6 of Figure 1). Cell 221 shifts the 
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carry out from its adder "AD" to the most-significant-bit 
of the partial product of the next row of ceils by setting 
the configuration bit CF_S to bias the transistor 1 10 to a 
low imp y edance state. Cell 222 shifts the carry out from 
it^acider "AD" to the most- significant-bit of the product 
^/-{bit R5) by setting the configuration bit CF_S to bias the 
transistor 110 to a low impedance state. The multiplier 
may operate in an asynchronous mode or be broken 
into pipeline stages by using the registers in each cell 
(FF 108 of Figure 1) to store the partial products. The 
multiplier depicted in Figure 2 is shown as an asynchro- 
nous multiplier. 

Thus, a direct result of the configuration of the arith- 
metic cell 1 00 (Figure 1) is a rectangular "n'nrT multiplier 
configuration where the "n" bits of the multiplicand are 
vertically aligned with the "n" most-significant-bits of the 
product and the "m" multiplier bits form rows of the mul- 
tiplier array and are aligned with the "m" least-signifi- 
cant-bits of the product. This circuit structure is highly 
compatible with field programmable devices for defining 
efficient and flexible multiplier and adder arrays. 

A second example of the application of the arithme- 
tic cell is a 4 bit accumulator (Figure 3). The accumula- 
tor function is added to the basic adder circuit by using 
configuration bits CF_4, CF_5 and CF_6 to set respec- 
tive multiplexer steering logic 104, 1 05. and 106 so as to 
route the output of the flip-flop (FF) 108 back to input B 
of the 1 bit adder (ADDR) 104. 

The third example of the application of the arithme- 
tic cell 100 is a 2 bit MAC (Figure 4, 2 bit Multiplier Accu- 
mulator). In Figure 4, cells 410, 420, 411 and 421 are 
configured to form a 2 bit multiplier in a manner similar 
to that illustrated in the 3 bit case of Figure 2. Cells 412 
and 422 are configured to form a 2 bit accumulator in a 
manner similar to that illustrated in the 4 bit case of Fig- 
ure 3. In a similar manner to the "horizontal" accumula- 
tor cells of the two high order result bits , cells 400 and 
401 are configured to form a 2 bit "vertical" accumulator 
for the two low order result bits (R0:R1). 

Figure 5 is a circuit schematic of a 5-by-5 matrix of 
arithmetic cells of the type represented in Figure 1 
wherein the cells are configured to provide a 4 bit multi- 
plier circuit. Thus, Figure 5 illustrates how the 2 bit MAC 
of Figure can be expanded to form a 4 bit MAC. In Fig- 
ure 5, cells 510, 520, 530, 540, 51 1 , 521, 531, 541 , 512, 
522, 532, 542, 513, 523, 533 and 543 are configured to 
form a 4 bit multiplier in a manner similar to that illus- 
trated in the 3 bit case of Figure 2. Cells 514, 524, 534 
and 544 are configured to form a 4 bit accumulator in a 
manner similar to that illustrated in Figure 3. In a similar 
manner to the "horizontal" accumulator cells of the four 
high order result bits , cells 500, 501, 502 and 503 are 
configured to form a 4 bit "vertical" accumulator for the 
four low order result bits (R0:R3). 

Thus it can be seen that an advantage of this inven- 
tion is to provide an array building block for field pro- 
grammable devices that permits an efficient 
implementation of multipliers and other basic arithmetic 



functions. The two multiplexers shown in Figure 1 as M1 
and M2 are efficiently included in the arithmetic cell thus 
eliminating the need to consume a general logic cell in 
existing field programmable devices. The collection of 
5 programmable devices in the arithmetic cell is mini- 
mized to efficiently perform the functions: 

addition A + B = R 
compare A =? B 
io subtraction A-B = RorB-A=R 
accumulate A + R(n-l) = R(n) 
multiply A * B = R 

multiply-accumulate A(n)*B(n) + R(n-1) = R(n) 

15 Physical layout of the arithmetic cell is estimated to 
consume about one quarter of a general logic cell. The 
cell compaction and increased function increases the 
effective cell density by a factor of 10. Performance is 
improved by reducing intercell wire capacitance found in 

20 general logic cells and by reducing the number of series 
configuration switches that must be used to implement 
arithmetic functions from general logic cells. 

Another advantage of the arithmetic cell is the gen- 
eral nature of the cell that allows for building mixtures of 

25 n*m multipliers, adders, registers, and accumulators 
within an homogeneous array of the arithmetic ceiis. 
The collection of wires, components, and configuration 
bits are sufficient to build complete arithmetic circuits 
without requiring the use of general logic cells. 

30 The many features and advantages of the invention 
are apparent from the written description and thus it is 
intended by the appended claims to cover all such fea- 
tures and advantages of the invention. Further, because 
numerous modifications and changes will readily occur 

35 to those skilled in the art, it is not desired to limit the 
invention to the exact construction and operation as 
illustrated and described. Hence, all suitable modifica- 
tions and equivalents may be resorted to as falling 
within the scope of the invention. 

40 

Claims 

1. An arithmetic cell [100] for a programmable device, 
the cell [100] comprising: 

45 t 

- an adder [109]; 

a first multiplexer [111]; 
a second multiplexer [1 12]; and 
a steering logic [101-107] configurable to per- 
50 form a multiplication operation, 

wherein the first and second multiplex- 
ers [1 1 1 , 112] cooperate to perform a test on 
an active bit of a multiplier input to the adder, 
and rf the active bit of the multiplier is a "1", a 
55 multiplicand input to the adder is added to a 

partial product and the resultant partial product 
is then shifted, 

however, if the active bit of the multiplier 
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is a "0", the resultant partial, product is shifted 
without adding. 

2. The arithmetic cell [100] as set forth in claim 1, 
wherein the adder [109] is a look-up table config- s 
ured as an adder. 

3. The arithmetic cell [100] as set forth in claims 1 or 

2, further comprising aflip-flop [108] configurable to 
latch the result from the adder [109] and thereby 10 
perform a multiply-accumulale operation. 

4. The arithmetic cell [100] as set forth in claim 3, 
wherein the steering logic [101-107] can be config- 
ured to perform accumulation in the flip-flop [108] is 
without multiplication. 

5. The arithmetic cell [100] as set forth in claim 3, 
wherein the steering logic [101-107] and the adder 
[109] can be configured to perform latching in the 20 
flip-flop [108] without multiplication and without 
accumulation. 

6. The arithmetic cell [100] as set forth in claim 1 , 2 or 

3, wherein the steering logic [101-107] and the 25 
adder [109] can be configured to perform addition. 

7. The arithmetic cell [100] as set forth in claim 1 , 2 or 
3, wherein the steering logic [101-107] and the 
adder [1 08] can be configured to perform compari- 30 
son. 

8. The arithmetic cell [100] as set forth in claim 1 , 2 or 
3, wherein the steering logic [101-107] can be con- 
figured to perform subtraction. . 35 

9. A programmable device, the programmable device 
comprising an array of cells wherein at least one of 
the cells is an arithmetic cell [100] as set forth in 
claims 1 , 2, 3, 4, 5, 6, 7 or 8. 40 

10. A method of performing multiplication within an 
arithmetic cell [100] for a programmable device, the 
arithmetic cell including an adder [108], first and 
second multiplexers [111, 112] and a configurable 45 
steering logic [101-107], the method comprising the 
steps of: 

configuring the steering logic [101 -107] to per- 
form the multiplication operation; and so 
using the first and second multiplexers [111. 
112] to cooperatively perform a test on an 
active bit of a multiplier input to the adder [108]; 
if the active bit of the multipliar is a "1 ", a multi- 
plicand input to the adder [108] is added to a ss 
partial product and the resultant partial product 
is then shifted; 

however, if the active bit of the multiplier is a 
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"0", the resultant partial product is shifted with- 
out adding. 
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